Development of a southern Swedish clustergen voice for speech synthesis

نویسنده

  • Johan Frid
چکیده

This paper describes the development of a speech synthesis voice with a southern Swedish accent. The voice is built for the Festival speech synthesis system using the tools in the festvox suite. The voice type is clustergen, which is a statistical­parametrical synthesis method where parametrical models for phonemes, duration and pitch all are built from a labeled speech database.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling

Unit selection synthesis has shown itself to be capable of producing high quality natural sounding synthetic speech when constructed from large databases of well-recorded, well-labeled speech. However, the cost in time and expertise of building such voices is still too expensive and specialized to be able to build individual voices for everyone. The quality in unit selection synthesis is direct...

متن کامل

Adapting the Filibuster text-to-speech system for Norwegian bokmål

The Filibuster text-to-speech system is specifically designed and developed for the production of digital talking textbooks at university level for students with print impairments. Currently, the system has one Swedish voice, 'Folke', which has been used in production at the Swedish Library of Talking Books and Braille (TPB) since 2007. In August 2008 the development of a Norwegian voice (bokmå...

متن کامل

Optimizations and fitting procedures for the liljencrants-fant model for statistical parametric speech synthesis

Every parametric speech synthesizer requires a good excitation model to produce speech that sounds natural. In this paper, we describe efforts toward building one such model using the Liljencrants-Fant (LF) model. We used the Iterative Adaptive Inverse Filtering technique to derive an initial estimate of the glottal flow derivative (GFD). Candidate pitch periods in the estimated GFD were then l...

متن کامل

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Voice source properties of the speech code

This is an outline of the knowledge we need in order to include the voice source in an advanced model of speech production with applications to text-to-speech rules. Recent results from studies of the Swedish language provide information of source properties and source-vocal tract interaction as a function of the segmental and prosodic frame within an utterance and with reference to aerodynamic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008